A hybrid approach to automatic segmentation and labeling for Mandarin Chinese speech corpus
نویسندگان
چکیده
In this paper, we propose a hybrid approach to refine the phonetic boundaries in a Mandarin speech corpus. This approach employs different sets of acoustic features for different categories of phonetic transitions, except for the most difficult case of “periodic voiced + periodic voiced”, which is therefore handled by a heuristic scheme. Several experiments are designed to demonstrate the feasibility of the proposed approach.
منابع مشابه
Towards A Phoneme Labeled Mandarin Chinese Speech Corpus
Phoneme level transcription of speech corpora is crucial to fundamental speech research and the increasingly interested detection-based automatic speech recognition. Currently, there is no existing phoneme-labeled Mandarin Chinese speech corpus. This paper presents our recent work towards development of such a corpus. Our goal is to label five hours of speech data selected from a Mandarin Chine...
متن کاملAutomatic Segmentation and Labeling for Mandarin Chinese Speech Corpus for Concatenation-based TTS
Corpus for Concatenation-based TTS Cheng-Yuan Lin, Jyh-Shing Roger Jang, Kuan-Ting Chen Multimedia Information Retrieval Laboratory Dept. of Computer Science National Tsing Hua University HsingChu, Taiwan +88635715131-3506 {gavins, jang, marco}@wayne.cs.nthu.edu.tw ABSTRACT Precise phone/syllable boundary labeling of utterances in a speech corpus plays an important role in constructing corpus-b...
متن کاملAutomatic Segmentation and Labeling for Mandarin Chinese Speech Corpora for Concatenation-based TTS
Precise phone/syllable boundary labeling of the utterances in a speech corpus plays an important role in constructing a corpus-based TTS (text-to-speech) system. However, automatic labeling based on Viterbi forced alignment does not always produce satisfactory results. Moreover, a suitable labeling method for one language does not necessarily produce desirable results for another language. Henc...
متن کاملAutomatic prosodic break labeling for Mandarin Chinese speech data
For corpus-based speech synthesis, large quantities of labeled speech are required. Manually labeling speech data is quite labor-intensive. Therefore, automatic speech labeling is highly desired. Prosodic break detection is one of the tasks for automatic speech labeling. In the paper, we propose an automatic break detection algorithm for mandarin Chinese speech. In this approach, we use energy ...
متن کاملAutomatic Prosodic Break Lab Chinese Speech
For corpus-based speech synthesis, large quantities of labeled speech are required. Manually labeling speech data is quite laborintensive. Therefore, automatic speech labeling is highly desired. Prosodic break detection is one of the tasks for automatic speech labeling. In the paper, we propose an automatic break detection algorithm for mandarin Chinese speech. In this approach, we use energy c...
متن کامل